Optimal hyper-scalable load balancing with a strict queue limit
نویسندگان
چکیده
Load balancing plays a critical role in efficiently dispatching jobs parallel-server systems such as cloud networks and data centers. A fundamental challenge the design of load algorithms is to achieve an optimal trade-off between delay performance implementation overhead (e.g. communication or memory usage). This has primarily been studied so far from angle amount required asymptotically performance, particularly vanishing large-scale systems. In contrast, present paper, we focus on arbitrarily sparse budget, possibly well below minimum requirement for delay, referred hyper-scalable operating region. Furthermore, may only be admitted when specific limit queue position job can guaranteed. The centerpiece our analysis universal upper bound achievable throughput any dispatcher-driven algorithm given budget limit. We also propose scheme which operate at message rate enforce limit, while allowing server states captured via closed product-form network, servers act customers traversing various nodes. distribution leveraged prove that tight proposed throughput-optimal many-server regime constraints. Extensive simulation experiments are conducted illustrate results.
منابع مشابه
Scalable Load-Distance Balancing
We introduce the problem of load-distance balancing in assigning users of a delay-sensitive networked application to servers. We model the service delay experienced by a user as a sum of a network-incurred delay, which depends on its network distance from the server, and a server-incurred delay, stemming from the load on the server. The problem is to minimize the maximum service delay among all...
متن کاملScalable Load Balancing
In this paper, we develop load balancing strategies for scalable high-performance parallel A* algorithms suitable for distributed-memory machines. In parallel A* search, ineeciencies such as processor starvation and search of nonessential spaces (search spaces not explored by the sequential algorithm) grow with the number of processors P used, thus restricting its scalability. To alleviate this...
متن کاملJoin-Idle-Queue: A novel load balancing algorithm for dynamically scalable web services
The prevalence of dynamic-content web services, exemplified by search and online social networking, has motivated an increasingly wide web-facing front end. Horizontal scaling in the Cloud is favored for its elasticity, and distributed design of load balancers is highly desirable. Existing algorithms with a centralized design, such as Join-the-Shortest-Queue (JSQ), incur high communication over...
متن کاملLocally Optimal Load Balancing
This work studies distributed algorithms for locally optimal load-balancing : We are given a graph of maximum degree ∆, and each node has up to L units of load. The task is to distribute the load more evenly so that the loads of adjacent nodes differ by at most 1. If the graph is a path (∆ = 2), it is easy to solve the fractional version of the problem in O(L) communication rounds, independentl...
متن کاملSparse matrix decomposition with optimal load balancing
Optimal load balancing in sparse matrix decomposition without disturbing the row/column ordering is investigated. Both asymptotically and run-time efficient exact algorithms are proposed and implemented for one-dimensional (1D) striping and two-dimensional (2D) jagged partitioning. Binary search method is successfully adopted to 1D striped decomposition by deriving and exploiting a good upper b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Performance Evaluation
سال: 2021
ISSN: ['0166-5316', '1872-745X']
DOI: https://doi.org/10.1016/j.peva.2021.102217